# Multi-round dialogue optimization
Flashvl 2B Dynamic ISS
Apache-2.0
FlashVL is a new approach to optimizing vision-language models (VLMs) for real-time applications, aiming to achieve ultra-low latency and high throughput without sacrificing accuracy.
Image-to-Text
Transformers Supports Multiple Languages

F
FlashVL
117
2
Qwen3 4B INT8
Apache-2.0
A large language model with 4B parameters based on the Hugging Face transformers library, supporting functions such as text generation, thinking mode switching, tool invocation, and long text processing.
Large Language Model
Transformers

Q
zhiqing
1,904
1
Qwen3 0.6B Bf16
Apache-2.0
This is an MLX-format text generation model converted from Qwen/Qwen3-0.6B, supporting Chinese and English text generation tasks.
Large Language Model
Q
mlx-community
1,812
5
Qwen3 0.6B 8bit
Apache-2.0
Qwen3-0.6B-8bit is an 8-bit quantized version converted from Qwen/Qwen3-0.6B, a text generation model suitable for the MLX framework.
Large Language Model
Q
mlx-community
2,625
3
Google Gemma 3 27b It Qat GGUF
A quantized version based on Google Gemma 3's 27-billion parameter instruction-tuned model, generated using quantization-aware training (QAT) weights, supporting multiple quantization levels to meet different hardware requirements.
Large Language Model
G
bartowski
14.97k
31
Google Gemma 2 27b It AWQ
Gemma 2 27B IT is a 4-bit large language model based on AutoAWQ quantization, suitable for dialogue and instruction-following tasks.
Large Language Model
Safetensors
G
mbley
122
2
Tiny Random Llama 4
Apache-2.0
This is a lightweight version of Llama-4-Scout-17B-16E-Instruct, providing users with a more streamlined usage option.
Large Language Model
Transformers

T
llamafactory
1,736
0
Llama Xlam 2 8b Fc R Gguf
xLAM-2 is a large action model built on an advanced data synthesis and training pipeline. It excels in multi-round dialogue and tool usage, and can transform user intentions into executable actions.
Large Language Model
Transformers English

L
Salesforce
1,809
5
Gemma 3 4b It GGUF
Gemma 3.4B IT is a lightweight open-source large language model released by Google. Based on a parameter scale of 3.4B, it is suitable for dialogue and instruction following tasks.
Large Language Model
Transformers

G
tensorblock
395
0
Gemma 3 4b It GGUF
Gemma-3-4b-it is a lightweight language model released by Google, based on the Gemma architecture and suitable for text generation tasks.
Large Language Model
Transformers

G
gaianet
1,910
0
Llama 3.1 Swallow 70B Instruct V0.3
Llama 3.1 Swallow is a series of large language models built on Meta Llama 3.1. It enhances Japanese language capabilities through continuous pre-training while retaining English language capabilities.
Large Language Model
Transformers Supports Multiple Languages

L
tokyotech-llm
1,659
12
Llama 3.1 Swallow 8B Instruct V0.3
Llama 3.1 Swallow is a series of large language models built on Meta Llama 3.1. It enhances Japanese capabilities through continuous pre-training while retaining English capabilities.
Large Language Model
Transformers Supports Multiple Languages

L
tokyotech-llm
16.48k
20
Lumimaid V0.2 70B
Lumimaid 0.2 is a model based on Meta-Llama-3.1-70B-Instruct. Compared with version 0.1, there has been a huge improvement in the dataset. After data cleaning and optimization, it provides a better user experience.
Large Language Model
Transformers

L
NeverSleep
173
43
Llama 3.1 8B Instruct Abliterated Via Adapter
Eliminate the rejection response problem of the Llama-3.1-8B-Instruct model through LoRA technology
Large Language Model
Transformers

L
grimjim
3,173
30
Lumimaid V0.2 8B
Lumimaid 0.2 is a model optimized based on Meta-Llama-3.1-8B-Instruct. Its performance has been significantly improved through data cleaning and expansion, providing higher-quality text generation services.
Large Language Model
Transformers

L
NeverSleep
290
75
Mistral Nemo Instruct 2407 Awq
Mistral-Nemo-Instruct-2407 is a large language model fine-tuned for instructions based on the Mistral architecture, suitable for various natural language processing tasks.
Large Language Model
Transformers

M
casperhansen
5,322
11
Hermes 2 Theta Llama 3 8B 32k
Hermes-2 Θ Llama-3 8B is a powerful model that combines the advantages of Hermes 2 Pro and Meta's Llama-3 Instruct, and performs well in various tasks. It supports multiple prompt formats and function calls.
Large Language Model
Transformers English

H
OpenPipe
1,784
8
Karakuri Lm 70b Chat V0.1
Other
KARAKURI LM is a pre-trained language model built on Llama 2, which enhances Japanese processing capabilities and is further pre-trained on Japanese and multilingual corpora.
Large Language Model
Transformers Supports Multiple Languages

K
karakuri-ai
2,300
24
Leo Hessianai 7b Chat
The first open commercial-use German base language model built on Llama-2, focusing on German language processing
Large Language Model
Transformers Supports Multiple Languages

L
LeoLM
2,263
17
Featured Recommended AI Models